Discrimination and Retrieval of Environmental Sounds
نویسندگان
چکیده
The human auditory sense may be regarded as the second most important sense after the sense of sight. This valuation is reflected in the field of information retrieval where until recently research concentrated on visual information retrieval. Even research in audio retrieval (AR) focused on one single aspect of hearing, namely understanding of speech. With the upcoming of large music databases in recent years, a second area of AR gained importance: music information retrieval (MIR). The goal of MIR is to enable efficient search and retrieval in the music databases mentioned above. The latest research area in the domain of audio retrieval is the retrieval of environmental sounds. One may argue that environmental sound retrieval deserves a more prominent role than it has. Most sounds humans hear are neither speech nor music but various environmental sounds. By incorporating environmental sounds into retrieval systems, a vast amount of additional information becomes available. In this thesis the applicability of a range of audio features in the domain of environmental sound retrieval is investigated. Furthermore state-of-the-art techniques in audio retrieval are identified by a broad survey of relevant literature covering all three areas of AR (speech, music, and environmental sounds). The quality of the features is examined with three different classification techniques. Finally, a set of novel audio features, developed by the author, is compared to established features. Results indicate that further research is necessary. There is particularly a lack of low-dimensional and computationally cheap audio descriptors suitable for the use in environmental sound retrieval.
منابع مشابه
Vibrotactile Identification of Signal-Processed Sounds from Environmental Events Presented by a Portable Vibrator: A Laboratory Study
Objectives: To evaluate different signal-processing algorithms for tactile identification of environmental sounds in a monitoring aid for the deafblind. Two men and three women, sensorineurally deaf or profoundly hearing impaired with experience of vibratory experiments, age 22-36 years. Methods: A closed set of 45 representative environmental sounds were processed using two transposing (TRH...
متن کاملRobust discrimination between EEG responses to categories of environmental sounds in early coma
Humans can recognize categories of environmental sounds, including vocalizations produced by humans and animals and the sounds of man-made objects. Most neuroimaging investigations of environmental sound discrimination have studied subjects while consciously perceiving and often explicitly recognizing the stimuli. Consequently, it remains unclear to what extent auditory object processing occurs...
متن کاملPathologies cardiac discrimination using the Fast Fourir Transform (FFT) The short time Fourier transforms (STFT) and the Wigner distribution (WD)
This paper is concerned with a synthesis study of the fast Fourier transform (FFT), the short time Fourier transform (STFT and the Wigner distribution (WD) in analysing the phonocardiogram signal (PCG) or heart cardiac sounds. The FFT (Fast Fourier Transform) can provide a basic understanding of the frequency contents of the heart sounds. The STFT is obtained by calculating the Fourier tran...
متن کاملMedial prefrontal cortex circuit function during retrieval and extinction of associative learning under anesthesia
Associative learning is encoded under anesthesia and involves the medial prefrontal cortex (mPFC). Neuronal activity in mPFC increases in response to a conditioned stimulus (CS+) previously paired with an unconditioned stimulus (US) but not during presentation of an unpaired stimulus (CS-) in anesthetized animals. Studies in conscious animals have shown dissociable roles for different mPFC subr...
متن کاملModeling storage and retrieval of memories in the brain
We have proposed a neural network model that stores the incoming information after orthogonalizing it in the same manner as vectors are orthogonalized. The scheme enables the brain to compare a new informational system with those in the memory and store its similarities and differences with the old memories in an economical manner. This allows the brain to have an enormous capacity and yet the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005